Top-Frequency Parallel Coordinates Plots

نویسندگان

  • Vincent Yang
  • Harrison Nguyen
  • Norman Matloff
  • Yingkang Xie
چکیده

Parallel coordinates plotting is one of the most popular methods for multivariate visualization. However, when applied to larger data sets, there tends to be a “black screen problem,” with the screen becoming so cluttered and full that patterns are difficult or impossible to discern. Xie and Matloff (2014) proposed remedying this problem by plotting only the most frequently-appearing patterns, with frequency defined in terms of nonparametrically estimated multivariate density. This approach displays “typical” patterns, which may reveal important insights for the data. However, this remedy does not cover variables that are discrete or categorical. An alternate method, still frequency-based, is presented here for such cases. We discretize all continuous variables, retaining the discrete/categorical ones, and plot the patterns having the highest counts in the dataset. In addition, we propose some novel approaches to handling missing values in parallel coordinates settings.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Practical Application of Parallel Coordinates to Hurricane Trend Analysis

In climate studies, weather scientists are interested in discovering which environmental factors have the greatest influence on significant weather phenomena. Due to the destructiveness of recent hurricane seasons, some scientists are focusing their studies on discovering which environmental variables have the greatest impact on the intensity and frequency of seasonal storm activity using stati...

متن کامل

Project proposal: User-controlled construction of parallel coordinate plots

Parallel coordinate plots [4] are useful for visualizing multidimensional data. They can also quickly become cluttered and difficult to read, especially when displaying a large number of dimensions. Parallel coordinate plots are useful with a reasonably small set of dimensions; the order of these axes is very important [5]. There have been a variety of proposed solutions to the number and order...

متن کامل

Constructing Parallel Coordinates Plot for Problem Solving

The paper reports about authors’ investigation of applicability of a well-known technique of visualization of multivariate data, parallel coordinates plot, to different kinds of tasks. Some new methods of transformation of parallel coordinates plots are suggested. However, the primary aim of the authors is “to link tools to tasks”, i.e. to explicitly define the tasks that can be appropriately s...

متن کامل

The Parallel Coordinates Matrix

We introduce the parallel coordinates matrix (PCM) as the counterpart to the scatterplot matrix (SPLOM). Using a graph-theoretic approach, we determine a list of axis orderings such that all pairwise relations can be displayed without redundancy while each parallel-coordinates plot can be used independently to visualize all variables of the dataset. Therefore, existing axis-ordering algorithms,...

متن کامل

User-controlled construction of parallel coordinate plots

Parallel coordinate plots [7, 9] are useful for visualizing multidimensional data. There are many advantages to using parallel coordinate plots in data visualization: they can represent data across many dimensions, they are reasonably easy to interpret and do not require a great deal of expertise to use, they are not domain-specific and can be applied to almost any kind of data, and they can be...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1709.00665  شماره 

صفحات  -

تاریخ انتشار 2017